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Abstract: We give the trade-off curve showing the capacity of a quantum channel as 
a function of the amount of entanglement used by the sender and receiver for transmit- 
ting information. The endpoints of this curve are given by the Holevo-Schumacher- 
Westmoreland capacity formula and the entanglement-assisted capacity, which is the 
maximum over all input density matrices of the quantum mutual information. The 
proof we give is based on the Holevo-Schumacher- Westmoreland formula, and also 
gives a new and simpler proof for the entanglement-assisted capacity formula. 

1 Introduction 

Information theory says that the capacity of a classical channel is essentially unique, 
and is representable as a single numerical quantity, giving the amount of information 
that can be transmitted asymptotically per channel use 1151 l4l. Quantum channels, 
unlike classical channels, do not have a single numerical quantity which can be defined 
as their capacity for transmitting information. Rather, quantum channels appear to 
have at least four different natural definitions of capacity, depending on the auxiliary 
resources allowed, the class of protocols allowed, and whether the information to be 
transmitted is classical or quantum. 

This paper will discuss the transmission of classical information over quantum 
channels. One of the first results in this area was an upper bound proved by Holevo 1 6 j 
on how much classical information could be transmitted by an ensemble of quantum 
states. Holevo |7| and Schumacher and Westmoreland 1 14| independently discovered 
proofs that this bound was achievable. This gives the theorem 

Theorem (Holevo; Schumacher and Westmoreland): The classical capacity ob- 
tainable using codewords composed of tensor products of signal states oi, where the 
probability of using Oi is pi, is 

x{{^^,Pi}) = HC^PiCTi) ~ ^piH{cri), (1) 

i i 

where H{p) — — Trplogp is the von Neumann entropy of the density matrix p. 

Note that xiWiiPi}) is a function of the probabilistic ensemble of signal states 
{o'i,Pi] that we have chosen, where state ai has pi. When it is clear what this ensemble 
is, we may simply denote this by %. 
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A memoryless quantum communication channel is a linear trace preserving com- 
pletely positive map. Such maps can be expressed as 



N{p) = Y.A,pAl 

i 

where the Ai satisfy = /. A natural guess at the capacity of a quantum 

channel M would be the maximum of x over all possible probability distributions of 
channel outputs, that is, the capacity would be 

Xmax(A/') - max x({A/'(cr»),p,}) (2) 



max HiNiYp^ai)) - Y p,H{N{a,)) 



since the sender can effectively communicate to the receiver any of the states Af{ai). 
This maximum can be achieved using pure states ai . This quantity is clearly achievable. 
We do not know whether this is the capacity of a quantum channel; this is reducible to 
the question of additivity of the quantity Xmax, 

Xmax (A/'2), (3) 



a question which has in recent years received much study 111 1131 [161 II II 1101 . If we 

require the protocols to send states that are tensor products on the different uses of 
the quantum channel, this is indeed the achievable capacity. However, if the use of 
entanglement between separate inputs to the channel helps to increase channel capacity, 
it would be possible to exceed this Xmax- The capacity of a quantum channel can be 
shown to be the regularized form of Eq. (|2j, that is, 

lim -Xmax(AA®"). (4) 
n — >oo Ji 

The next capacity we discuss is the entanglement-assisted capacity of a quantum 
channel (T TI. In the entanglement-assisted capacity, the sender and receiver share 
entanglement at the start of the protocol, which they are allowed to use in the com- 
munication protocol. The entanglement-assisted capacity is given by the following 
formula. 

Theorem (Bennett, Shor, Smolin, Thapliyal): The classical capacity obtainable 
using a quantum channel J\f is 



CE^ma.Ji H{p) + H(U{p))-H[{N®I){(l)p)] (5) 



p 



where (f>p is a state over the tensor product of the input space and a reference sys- 
tem, Tiin ® Href, whose reduced density matrix on the channel's input space is p, i.e., 
Tr2(/>p = p. 

The amount of pure state entanglement consumed by the protocol given in 3 can 
be shown to be asymptotically H{p) ebits per channel use, where p is the density matrix 
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maximizing Eq. (|5|l, and an ebit is the amount of pure state entanglement in an EPR 
pair of qubits. 

This naturally leads to several questions. Is H{p) ebits per channel use the amount 
of entanglement required to achieve the entanglement-assisted capacity? More gen- 
erally, if the amount of entanglement available is P < H{p) ebits per channel use, 
how much classical information can be transmitted? We answer these questions in the 
following theorem. 

Theorem 1. If the available entanglement per channel use is restricted to P ebits, 
there is a protocol achieving the information rate given by 

max y^p,H{pi) + H{N{y^p^pi)) ~y^p^H{{N ®I){4>p,)) 
subject to '^piH{pi) < P, (6) 

i 

where Tr20p; = Pi. Here, the maximization is over all probabilistic ensembles of 
density matrices {pi,Pi} where pi G Tiin, J2iPi — '^"'^ average entropy of the 
ensemble, ^iPiH{pi), is at most P. 

In the case where P = 0, this gives the Holevo capacity Xmax of Eq. 0, as the 
Pi must all be pure states. In the case where P is sufficiently large, this gives the 
entanglement-assisted capacity of Eq. (jSj- Since we do not know whether the 
Holevo capacity is additive, we clearly cannot show that the above capacity trade-off is 
additive; this is an open question. We can however prove that this formula is an upper 
bound if we restrict ourself to protocols where the sender and receiver start by sharing 
pure entangled quantum states, and the sender is not allowed to distribute one of these 
entangled states among more than one channel use, the same restriction under which we 
know the Holevo capacity x is the correct formula for unassisted classical capacity. To 
get the true capacity trade-off formula (if it is not additive), we may have to regularize 
this formula. That is, to take the limit of the normalized entanglement-assisted capacity 
for the channel A/^®" as n goes to infinity. 

This theorem can also be derived using the methods of f5\. However, we give a 
quite different and somewhat simpler proof than in |5 J for the trade-off formula, as 
well as a simpler proof than j3| for the entanglement-assisted capacity. This proof 
relies on the Holevo-Schumacher- Westmoreland theorem above, so in this paper we 
are showing that knowing the left endpoint of this trade-off curve lets us derive the 
entire curve. 

2 The Protocol 

We now give the protocol that asymptotically achieves the capacity (|6j. We use block 
coding. We will let n be the number of channel uses in our block coding protocol. This 
protocol will take n entangled states and use them as the input for these channel uses. 
It will not distribute one of these entangled states over more than one channel use, but 
it will permute the entangled states before sending them through the channels, so the 
mapping of the entangled states to the channel inputs depends on the message being 
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sent. Suppose that the maximum of Eq. (|6} occurs at the ensemble {pi,pi}. We assume 
that the sender and the receiver start by sharing a number n of entangled states where 
there are ^ npi states for which the reduced density matrix is pi. For the proof 
that our protocol achieves its desired capacity, we will use the Holevo-Schumacher- 
Westmoreland theorem with nl 2('^^i)" signal states, where d is the dimension of the 
input space to the channel. These signal states are described as follows. 

First, Alice applies to her part of the state | (j>p.) a random sign change ±1 to the 
phase of each of the eigenvalues of pi. Note that there are 2"^^^ possible phase changes 
for each of the states | (f>p^), as Alice can without loss of generality apply the phase 
+ 1 to the first eigenvalue (since an overall phase change does not alter the quantum 
state). Next, Alice applies a random permutation to the n entangled states she shares 
with Bob. Since there are n! permutations, we have n\ 2"(''^i) signal states total. 

Before we can continue, we need a lemma. 

Lemma 1. Suppose we have n density matrices, pi, p2, ■ ■ ., Pn, which are drawn 
at random from some probability distribution on density matrices. Then 



lim -E 

n— >-oo 77, 



= Hip) (7) 



where the sum is over all nl permutations n of the n density matrices, the expectation 
E is over the random choice of pi ... p„, and p is the average density matrix for the 
probability distribution that the pi are drawn from. 

That the left hand side of Eq. (0 is at most the right hand side follows immediately 
from the subadditivity of entropy of quantum states. The proof of the other direction 
will be deferred until later. 

The proof of Theorem 1 is slightly nicer if we let the rii be random variables ob- 
tained by drawing n density matrices from a distribution where pi occurs with prob- 
abiUty pi. In other words, instead of Alice and Bob starting each coding block with 
exactly rii copies of the entangled state | (t)p. ), they use the next n states in a sequence 
of shared states where | 4)p. ) occurs with probability pi. It is not hard to prove that the 
protocol also works when they start with exactly k, npi states, although we will not 
prove this in the paper. 

We now look at the signal states more carefully. The first term in the Holevo capac- 
ity X, Eq. Q, is the entropy of the average output signal received by Bob. This signal 
consists of two parts, the quantum state A which was originally held by Alice, and was 
subsequently modified and sent through n uses of the channel Af, and the quantum 
state B, which Bob originally held and has kept. 

The random phase change applied by Alice disentangles Alice and Bob's entangled 
states \(j>i). We will work in the basis of the eigenvalues of pi. Let these eigenvalues 
be I Vij). In this basis, | (pp. ) = ^/Xij \ Vij) \ Vij). After the random phase change, 
the density matrix is J2j ^ij I I ^ I I ■ This is the same density mati'ix as 

is given by the ensemble containing the state | Vij ){vij \ (g) | Vij | with probability 
Xij . Let us assume then that Alice and Bob started by sharing n unentangled quantum 
states, each of which was in the state | ) | Vij ) with probability piXij . We will bound 
the entropy of Bob's average signal state by using this second ensemble, which must 
give the same answer, as the entropy depends only on the density matrix. What we do 
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is add an extra, classical, variable, which we denote by T. We let T tell us the type 
class of the distribution; that is, the variable T holds the numbers jii and the numbers 
rriij, where niij = rii and where rriij tells how many of these quantum systems 
started in the state | Vij) \ Vij). The reason we do this is that after Alice applies the 
phase changes and the random permutation to her quantum states, if we condition on 
T the quantum states Alice and Bob hold are now independent. This is because Alice 
inputs into the channel a mixture of all permutations of the n states consisting of rriij 
copies of I Vij) for each i, j, and this mixed state is determined solely by T. By entropy 
inequalities and the definition of conditional entropy. 

H{A)+H{B)>H{AB) > H{ABT) - H{T) 

= H{AB\T) 

= H{A\T) + H{B\T) (8) 

= H{AT) + H{BT) - 2H{T) 

> H{A)+H{B) -AH{T). 

However, since H{T) — 0(log n), we need only estimate H{A) and H{B) to compute 
the asymptotics of H{AB). We have 

H{B) ^Y.n,H{p,) ^nY,P^H{p^). (9) 

i i 

The state A is a mixture of all permutations of the density matrices M{\ Vij){vij |), 
where M{ \ Vij){vij |) occurs with probability PiXij, so by Lemma 1, 

H{A) « nH{N{p)) (10) 

where p = J^iPiPi- Thus, the first term of the HSW formula, H{AB), is approxi- 
mately 

H{AB) « H{A)+H{B) « n (^p^Hip,) + H{MiJ2p,p,))^ ' ^^^^ 

Finally, we look at the second term in the Holevo capacity Eq. ([l). This is the 
entropy (A/"®" ® /)(| ^■k.p){^it,p \), where | <i>7r,p) is the signal state her half of which 
Alice inputs into the channel. This state was produced by Alice first performing a 
random phase change P in the eigenbasis of pi, to her half of all of ther quantum states 
I (j>pi), and then applying a random permutation tt to all n of her states. It is easy to 
check that if Bob knows what these random phase changes and permutation were, he 
can undo them. Thus, all nl 2^'^^^^" signal states give rise to the same joint entropy, 
which is J2i niH(^{I ®Af){ \ 4>pi){<i'pi I)) - This is the last term of Eq. (|6}. We thus have 
a protocol that asymptotically achieves Eq. (|6}. 
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3 Proof of the Lemma 



We now prove the following lemma, which will imply Lemma 1 . 

Lemma 2. Suppose that W6 hQve 71 dsnsity ffiQtrices p\, p2, ■ ■ Pn- Let 

IT 

be the expected entropy of the average ofk of these density matrices chosen randomly 
without replacement from the n density matrices. Then 

V^' TT ) k=\ 

Proof: We let be a variable which gives the values of the images of the first k 
elements of the permutation vr: 7r(l), 7r(2), . . ., 7r(fc). Then 



fc+i 



^(^TiE^/'-o)) -^(^E(8)^-o)) (14) 



TT j=l ' ^ TT j = l 



where 7r|Tfc is the set of permutations which have their first k elements fixed by T^. The 
inequality above is an application of the strong superadditivity property of quantum 
entropy. Now, by adding the left hand sides of the above expression (I14> for k between 
and n — 1, we obtain a telescoping series which gives the left hand side of Eq. ( I13> . 
Adding the right-hand side of Eq. J14> for k between and n — 1 gives the right hand 
side of Eq. ( I13> . proving the lemma. 

Suppose now that pi, pi, . . ., p„ are matrices drawn identically and independently 
from some probability distribution. The above lemma implies that 



E 



>J2^H{pk) (15) 

k=l 



where now note that EH{pk) is the expected entropy of the average of k density ma- 
trices drawn from the probability distribution. But for a finite dimensional quantum 
space, 'EH{pk) is easily seen to converge to H{p), where p is the average density 
matrix of the probability distribution. This completes the proof of Lemma 1 . 
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4 The upper bound. 



What we do now is show an upper bound on the capacity of a quantum channel assisted 
by limited entanglement of P ebits per channel use, subject to the proviso that Alice 
cannot input a state entangled over more than one channel use. We will assume the 
following scenario. Alice and Bob start with a set of pure entangled states {(pi), where 
we define pi = Tr^ | <j>i){<j>i |. We let Alice perform an arbitrary unitary transformation 
on her states, and then send part (or all) of the resulting state through the channel. We 
do not let Alice input the same | 0j) into more than one channel use. We will use the 
Holevo bound to bound the information that Bob can receive using such a protocol. 

We will first start with the assumption that Alice's part of their shared state occupies 
a Hilbert space with the same dimension as the channel input, i.e., dim pi — dim Hiu- 
Now, Alice will perform a unitary transformation to obtain the state UjPiUj, and send 
it through the channel. We now use the Holevo bound, Eq. Q, to bound the capacity 
Alice and Bob can achieve using such a protocol. Again, Bob's signal consists of the 
state he received from Alice through the channel together with the state that he kept. 
By the subadditivity of entropy, the first term of Eq. Q is bounded by the entropy of 
the average output of Ahce's channel plus the average entropy of the reduced states 
held by Bob. If Uj pi C/| is sent with probability pij , then this first term is bounded by 

Y,p.,H{p,) + H{Y^p.,,M{U,p.U])), 

which is the same as the first two terms in the formula (|6}, assuming that we used the 
ensemble {Uj piUj , pij} in formula (|6|l. The second term of the Holevo bound Q is 
also identical in these two scenarios. Specifically, it is 

J2PnH(^iAf^I){{Uj(g>I) I 1 (L/j®/))) = ;^p,,i/((A/-®/)(T,j)), (16) 

ij 

U p u''- /\'Pu-p U^ ^ purification of UjPiUj . Thus, if Alice applies 
unitary transformations to pij and inputs the entire resulting state into the channel, she 
cannot achieve a capacity better than that given by Theorem 1 . 

We now show that the same bound applies if Alice is allowed to put only a part 
of her quantum state through the channel. That is, Alice and Bob share an entangled 
pure state | (pi) where Tr^ | 4ii){(j)i | = G Hm ^ Ti-rd- AUce puts Hin through the 
channel, and discards Tircf ■ We will compare the capacity achieved by this case with 
that achieved by an alternative scenario. If Alice measures the reference system Hici in 
the basis determined by the eigenvalues of TrjnPi, she obtains a probability distribution 
Pij over quantum states pij e Hin- We will show that replacing pi with the ensemble 
of states {pij,Pij} increases the capacity, while decreasing the amount of pure state 
entanglement consumed by the protocol. 

Holevo's bound shows that the capacity of such a protocol can be at most 

^Ki7(pO+^(A^(^Tr,rfP,)) +E^''^((-^®^)(Trrcf |0p,)(0p. |)), (17) 



where 
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where 4>p^ is the joint pure state of AHce and Bob. If all the pi are in TYjn (so we need 
no reference system Hrd), then illi gives the same capacity as (|6j. 

Let the output of the channel be denoted by A, and let the reference system after 
Alice's measurement (which now contains the classical variable j telling which of the 
measurement outcomes was obtained) be R. Finally, let the part of the entangled state 
I (j>i) held by Bob be B. The replacement of pi by the ensemble {pij,Pij} does not 
change the average output of the channel, so the second term of ( I17> giving the entropy 
of the average output of the channel is unchanged. The contribution to the first and 
third terms from the state pi is proportional to 

H{B) - H{AB). (18) 

Replacing pi by the ensemble {py , pij } gives a contribution proportional to 

{H{BR) - H{R)) - {H{ABR) - H{R)) . (19) 

This second contribution M9\ is larger than the first ( I18> by the property of strong 
subadditivity of quantum entropy. 

It is also easy to see that the amount of pure state entanglement consumed by the 
protocol decreases after the replacement of pi by {pij, }, since Alice and Bob can 
obtain the ensemble of states {pij.Pij} from the state pi using solely local quantum 
operations and classical communication (LOCC operations), and these never increase 
the amount of entanglement. We thus see that the assumption that all of Alice's part 
of the entangled states was sent through the channel did not impose any restrictions on 
channel capacity. 

Finally, let us note that if Ahce takes her parts of two different entangled pure states 
and sends them through one channel use, this also cannot increase the capacity of the 
protocol. To see this, note that this case has essentially already been taken into account 
in our analysis, as the tensor product of the two pure entangled states can be considered 
as a single entangled state. Thus, the only case this is not covered by our analysis is 
when Alice's channel inputs are entangled over more than one channel use. This case 
is discussed briefly in the next section. 

5 Discussion 

We have given a formula that tells how much the classical capacity of a quantum chan- 
nel can be increased by the use of a limited amount of entanglement between the sender 
and receiver, which is consumed by the protocol for transmitting information. This pa- 
per is quite different in approach than the paper |5 1, which also gives a proof for this 
trade-off curve. It also yields a simpler proof of the original entanglement-assisted 
capacity formula |3J. 

It is not known whether we need to regularize the trade-off formula to find the 
capacity. In light of the recent discovery that many of the additivity problems in quan- 
tum information theory are equivalent 1 16|, a natural question is to ask whether this 
is equivalent to these other problems. We have not been able to show this, although it 
is clearly at least as hard, since additivity of the Holevo capacity, which is one of the 



8 



equivalent problems, is the special case of the trade-off curve when no entanglement is 
consumed. 

Finally, let us note that in order to achieve the capacity formula without Alice 
using inputs entangled between different channel uses, it appears that Alice and Bob 
need to be able to start by sharing arbitrary pure entangled states | (pp^), and that it is not 
sufficient for Alice and Bob to start by sharing solely EPR pairs. However, since pure 
state entanglement is an interconvertible resource 1 12 1, if we remove the restriction on 
Alice sending states entangled between different channel uses, then we can use EPR 
paks for the shared entanglement consumed by the protocol. 
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